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e Background and Aims The genus Rosa (150-200 species) is widely distributed throughout temperate and sub- 
tropical habitats from the northern hemisphere to tropical Asia, with only one tropical African species. In order to 
better understand the evolution of roses, this study examines infrageneric relationships with respect to conventional 
taxonomy, considers the extent of allopolyploidization and infers macroevolutionary processes that have led to the 
current distribution of the genus. 

e Methods Phylogenetic relationships among 101 species of the genus Rosa were reconstructed using sequences 
from the plastid psbA-trnH spacer, trnL intron, trnL-F spacer, trnS-G spacer and trnG intron, as well as from nu- 
clear glyceraldehyde 3-phosphate dehydrogenase (GAPDH), which was used to identify putative allopolyploids and 
infer their possible origins. Chloroplast phylogeny was used to estimate divergence times and reconstruct ancestral 
areas. 

e Key Results Most subgenera and sections defined by traditional taxonomy are not monophyletic. However, 
several clades are partly consistent with currently recognized sections. Allopolyploidy seems to have played an im- 
portant role in stabilizing intersectional hybrids. Biogeographic analyses suggest that Asia played a central role as a 
genetic reservoir in the evolution of the genus Rosa. 

e Conclusions The ancestral area reconstruction suggests that despite an early presence on the American continent, 
most extant American species are the results of a later re-colonization from Asia, probably through the Bering Land 
Bridge. The results suggest more recent exchanges between Asia and western North America than with eastern 
North America. The current distribution of roses from the Synstylae lineage in Europe is probably the result of a mi- 
gration from Asia approx. 30 million years ago, after the closure of the Turgai strait. Directions for a new sectional 
classification of the genus Rosa are proposed, and the analyses provide an evolutionary framework for future studies 
on this notoriously difficult genus. 
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INTRODUCTION 


The genus Rosa L. (roses; Rosoideae: Rosaceae) comprises 
about 150-200 species widely distributed throughout the tem- 
perate and sub-tropical habitats of the northern hemisphere 
(Rehder, 1940; Gu and Robertson, 2003), with the exception of 
one tropical African species. Approximately half of the rose 
species occur in Asia, while Europe and North America host 
approximately a quarter of the species each. The species of this 
genus are difficult to identify because of the homogeneity in 
morphology (see Fig. 1) associated with hybridization. 
Cultivated for >2000 years (Guoliang, 2003), roses are eco- 
nomically important as ornamental shrubs and cut flowers, as 
well as for perfumes, cosmetics and pharmaceutical research 
(see Cutler, 2003; and for recent pharmaceutical research, see 
Jager et al., 2007; Yi et al., 2007; Guimaraes er al., 2010). 
Moreover, numerous traits (small nuclear genome, extensive 
cross-species fertility and advanced industrial horticultural and 
micropropagation techniques) as well as their close affinity 


with several important woody Rosaceae crop species (e.g. rasp- 
berries, apples, almonds, cherries and peaches) suggest that 
roses could provide an ideal model for exploring woody plant 
genomes (see Bruneau et al., 2007; Debener and Linde, 2009). 
Roses have captured the interest of scientists in various mod- 
ern genetic fields (e.g. quantitative genetics and functional ge- 
nomics; see Debener, 2009). However, these studies have been 
conducted within the framework of a classification that is prob- 
ably obsolete because the most recent taxonomic treatment 
(Wissemann, 2003) still relies largely on the sub-divisions 
made >70 years ago (Rehder, 1940), and that are themselves 
adapted from 19th century arrangements (Crépin, 1889, 1891). 
Wissemann’s (2003) system divides the genus Rosa into four 
subgenera [R. subgen. Rosa, R. subgen. Hulthemia (Dumort.) 
Focke, R. subgen. Platyrhodon (Hurst) Rheder and R. subgen. 
Hesperhodos Cockerell] and the main subgenus Rosa into ten 
sections [R. sect. Pimpinellifoliae (DC.) Ser., R. sect. Rosa, 
R. sect. Caninae (DC.) Ser., R. sect. Carolinae Crép., 
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Fic. 1. Morphological diversity of flowers and fruits of a few representatives of Rosa. (A) Flower of Rosa hugonis. (B) Flower of R. moyesii. (C) Flower of R. multi- 

bracteata. (D) Flower of R. odorata. (E) Flower of R. praelucens. (F) Flower of R. prattii. (G) Flower of R. tsinglingensis. (H) Flower of R. roxburghii. (I) Flower of 

R. rubus. (J) Flower of R. roxburghii. (K) Flower of R. villosa. (L) Flower of R. primula. (M) Fruit of R. sikangensis. (N) Fruits of R. omeiensis. (O) Fruit of 
R. mairei. (P) Fruit of R. macrophylla. (Q) Fruits of R. sweginzowii. 
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R. sect. Cinnamomeae (DC.) Ser., R. sect. Synstylae DC., R. 
sect. Indicae Thory, R. sect. Banksianae Lindl., R. sect. 
Laevigatae Thory and R. sect. Bracteatae Thory], all subgenera 
and sections being identical to those of Rehder (1940). In addi- 
tion, Wissemann (2003) defined six new sub-sections in 
R. sect. Caninae (R. sub-sect. Trachyphyllae H. Christ, R. sub- 
sect. Rubrifoliae Crép., R. sub-sect. Vestitae H. Christ, R. sub- 
sect. Rubiginae H. Christ, R. sub-sect. Tomentellae H. Christ 
and R. sub-sect. Caninae). 

There have been numerous attempts at building a phylogeny 
that would give a new perspective on the genus [rapidly ampli- 
fied polymorphic DNA (RAPD) data, Debbener er al/., 1996; 
Millan et al., 1996; Jan et al., 1999; DNA sequences, 
Matsumoto et al., 1998, 2000, 2001; Iwata et al., 2000; Wu 
et al., 2000, 2001; Wissemann and Ritz, 2005; Bruneau ef al., 
2007; Qiu, 2012; microsatellite analyses, Scariot ef al., 2006; 
amplified fragment length polymorphism (AFLP) data, 
Koopman ef al., 2008]. However, these studies are contradic- 
tory and only a few of them support the monophyly of 
Wissemann’s sections. These previous studies faced numerous 
problems. First, in most of these studies, phylogenetic resolu- 
tion is poor and, where clades are resolved, support is often 
weak. This is explained partly by the extremely low levels of 
sequence divergence observed across the genus (e.g. 
Matsumoto et al., 1998; Wissemann and Ritz, 2005). Secondly, 
hybridization complicates phylogeny reconstruction in roses. 
Several studies have confirmed that interspecific hybridization 
is frequent in the genus (Ritz er al., 2005; Joly and Bruneau, 
2006; Joly et al., 2006; Schanzer and Vagina, 2007; Mercure 
and Bruneau, 2008; Schanzer and Kutlunina, 2010; Ritz and 
Wissemann, 2011; Kellner et al., 2012b). Indeed, Wissemann 
and Ritz (2005) noted several contradictions between their plas- 
tid and nuclear gene phylogenies. Thirdly, identification can 
also be problematic because Rosa taxonomy is further compli- 
cated by the publication of numerous names given to morpho- 
logical variants and hybrids (Wissemann, 2003). Problems with 
the identification of plant material or sequencing of hybrids 
could explain why conspecific samples sometimes fall into dis- 
tinct clades in some studies (see Bruneau ef al., 2007). 
Fourthly, sampling has often been incomplete and biased to- 
ward cultivated varieties or specific geographic areas. When 
several geographic areas were represented, usually only one of 
them was effectively represented by wild-collected samples, 
and the rest were represented by garden-grown specimens. 
Given the strong ability of roses to hybridize, the use of garden- 
grown specimens is questionable unless a wild origin is clearly 
established. 

In this study, we present a robust molecular phylogeny of the 
genus Rosa to provide the genus-wide perspective necessary to 
determine the origins of wild roses and to orient future studies 
on Rosa properly (floral evolution, traits genomics, conserva- 
tion studies, rose breeding, etc.). Wild samples from Asia, 
Europe and North America were collected and were supple- 
mented by a few herbarium or garden-grown samples (usually 
from wild origin). We use sequences from the chloroplast trnL 
intron and trnL-F and psbA-trnH intergenic spacers that have 
been found to be relatively variable in Rosa (Bruneau ef al., 
2007), but also include sequences from the trnS-G spacer and 
trnG intron, which Shaw ef al. (2007) suggested were highly in- 
formative regions. The use of chloroplast data seems to be 
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appropriate in Rosa to draw a first phylogenetic hypothesis 
without taking into account reticulate evolution, but we also 
present phylogenetic relationships obtained with the nuclear 
cytosolic glyceraldehyde §3-phosphate dehydrogenase 
(GAPDH) gene to highlight evidence for reticulate evolution in 
the genus. Finally, we infer a biogeographic history of the 
genus and draw general conclusions on our current knowledge 
of Rosa phylogeny in regards to the currently accepted 
taxonomy. 


MATERIALS AND METHODS 
Taxon sampling 


A total of 101 rose species are represented in our sampling, cor- 
responding to more than half of the species reported in the ge- 
nus (Supplementary Data S1). At least one species from each of 
the subgenera and each of the sections belonging to the genus 
Rosa were sampled. Our sampling includes North American, 
European and Asian species, as well as the African species 
R. abyssinica R. Br. Most samples were collected in the field, 
and material was preserved in silica gel for molecular analyses 
while vouchers were deposited in herbaria. This fairly complete 
sampling of samples collected in natural habitats was supple- 
mented by samples obtained from plants cultivated in gardens 
(from wild origin) or from herbarium specimens. When possi- 
ble, we avoided garden-grown specimens obtained from seeds 
because they are more likely to have undergone hybridization. 

Sequences of the trnL region (including the trnL-F spacer 
and trnL intron) and the psbA-trnH intergenic spacer were gen- 
erated for 107 samples of roses, and these were added to the 50 
sequences published by Bruneau ef al. (2007) for 25 more sam- 
ples. For the chloroplast t7nG region (including the t7nG intron 
and the trnG-trnS spacer), a total of 129 sequences were gener- 
ated and included in the analyses. Sequences of the three re- 
gions generated for Rubus biflorus were used as the outgroup. 
Most samples included were sequenced for the three different 
regions. However, three samples (one for R. foetida Herrm. and 
two for R. minutifolia Engelm.) are represented only by two se- 
quences because the t7mG region sequences could not be ob- 
tained. In addition, a sub-set of 55 of the 101 species were also 
sequenced for the nuclear cytosolic GAPDH gene. The 
GAPDH gene was amplified from the end of exon 7 [according 
to the Arabidopsis thaliana (L.) Heynh. sequence; GenBank lo- 
cus tag: At3¢04120] to the beginning of exon 11 (which is exon 
9 in A. thaliana; see Joly et al., 2006). Each of the 55 samples 
sequenced was represented by 1—5 sequences. We also included 
sequences previously published by Joly et al. (2006) and Meng 
et al. (2011) for 14 taxa. New sequences were submitted to 
GenBank, and accession numbers are given in Supplementary 
Data S1. 


Molecular methods 


DNA was extracted either using a modification of the Doyle 
and Doyle (1987) cetyltrimethylammonium bromide (CTAB) 
protocol or, notably for herbarium specimens, using the 
Tiangen Plant Genomic DNA Kit (Beijing) following instruc- 
tions from the manufacturer. The PCR amplification mix 
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contained 4—5 U of Tag DNA polymerase, Taq DNA polymer- 
ase buffer (Tiangen, Beijing) with 2mm MgCl, 200 um of each 
dNTP and 0-4 um of each primer. Amplifications were con- 
ducted using a PTC-0200 thermocycler (Bio-Rad, Beijing). 
Conditions for amplification of the trnL region were 3 min of 
initial denaturation at 94 °C, followed by 40 cycles of 30s at 
94 °C, 30s at 47-5 °C and | min30s at 72 °C, with a final step 
of 7min at 72 °C. Similar conditions were used for the other 
three regions, except that the annealing temperature was 
52:5 °C for the psbA-trnH intergenic spacer, 54 °C for the trnG 
region and 49 °C for the GAPDH region, and the elongation 
time was 30s for the psbA-trnH intergenic spacer and the trnG 
region and 2min for the GAPDH region. For the GAPDH 
region, when multiple copies were suspected (presence of dou- 
ble peaks in direct sequencing reads), cloning was performed. 
PCR products were cloned in pMD19-T vector (TaKaRa 
Biotechnology, Dalian, China) following the instructions of the 
manufacturer. Colonies were screened by PCR and selected 
colonies were incubated overnight in LB broth with appropriate 
antibiotics. Two (for diploids) to 18 clones (for pentaploids) 
were selected depending on the ploidy level reported in the spe- 
cies. PCR products or clones were sent to Invitrogen 
(Shanghai) for purification and sequencing. 

The ¢rnL intron and trnL-F spacer were amplified using the 
‘c’ and ‘f’ primer pair (Taberlet er a/., 1991) then sequenced us- 
ing the amplification pair as well as internal primers ‘d’ and ‘e’ 
as described by Taberlet et al. (1991). The psbA-trnH spacer 
was amplified and sequenced using “psbAF’ and ‘trnHR’ as de- 
scribed by Sang ef al. (1997). The trnS-G spacer and trnG in- 
tron were amplified using the ‘trnS’ and ‘trnG’ primer pair as 
described by Shaw ef al. (2007) then sequenced using those 
two primers and internal primers ‘trnG2S’ and ‘trnG2G’, as de- 
scribed by Shaw ef al. (2005). The GAPDH region was ampli- 
fied using GPDX7F (Strand ef al., 1997) and GPDX11R (Joly 
et al., 2006). The strands were combined and edited using 
Sequencher (version 4.14, Gene Codes Corporation, Ann 
Arbor, MI, USA). 


Phylogenetic analyses 


Sequence alignments were performed using Clustal W 
(Larkin et al., 2007) with the default parameters as imple- 
mented in MEGA 4 (Tamura ef al., 2007). Alignments were 
then verified and modified manually where inconsistencies 
were found. The alignments for the trnL and GAPDH were 
straightforward, but for the psbA-trnH spacer and trnG regions 
homology was difficult to assess in three highly repetitive re- 
gions that were subsequently removed from the analyses 
(182 bp of aligned sequences in the psbA-trnH spacer and 65 
plus 50 bp of aligned sequences in the t7nG regions). 

Phylogenetic analyses of the concatenated chloroplast se- 
quence data were performed under Maximum Likelihood opti- 
mization using RAxML 7.2.7 on CIPRES Science Gateway 
(Stamatakis, 2006; Stamatakis et al., 2008) with separate 
GTR+T° substitution models for each region and the fast 
bootstrap option using 1000 replicates. Bootstrap values were 
considered low when strictly inferior to 65 %, moderate 
between 65 and 80 % and strong when superior to 80 %. 
The plastid matrices are available in TreeBASE (http://purl.org/ 
phylo/treebase/phylows/study/TB2:S 15526). 


In vitro recombination of DNA sequences is a problem when 
cloning products of PCRs in which multiple alleles or paralo- 
gous copies have been amplified (Cronn ef al., 2002; Russell 
et al., 2010). For this reason, we screened the sequences and 
eliminated chimeric sequences as described in Russell er al. 
(2010). For polyploids, single nucleotide substitutions [single 
nucleotide polymorphisms (SNPs)] were considered as PCR er- 
rors when not shared by several clones. For diploids, SNPs 
were considered as PCR errors when no double peaks were pre- 
sent in direct sequencing. Nuclear analyses were performed us- 
ing the NeighborNet algorithm implemented in SplitsTree 
4.13.1 (Huson and Bryant, 2006) to reconstruct a network that 
would summarize better than a tree the complex relationships 
between the different copies of GAPDH present in diploids and 
polypploids. 


Divergence time analyses 


Because of the complex nature of nuclear results and because 
we did not want to exclude the numerous polyploid species in 
the genus, we used the plastid sequence data with greater taxon 
sampling to perform biogeographical analyses. Preliminary 
analyses using a relaxed lognormal clock as implemented in 
BEAST 1.7.3 (Drummond ef al., 2006; Drummond and 
Rambaut, 2007) failed to converge. A likelihood ratio test per- 
formed on the three plastid regions independently failed to re- 
ject the molecular clock hypothesis. As Brown and Yang 
(2011) suggested that a strict molecular clock could perform 
well when rate variation is low and that the likelihood ratio test 
was suitable to test the strict molecular clock hypothesis, diver- 
gence time analyses were conducted using a strict molecular 
clock as implemented in BEAST 1.7.3. The best partition 
scheme and the substitution models for each partition were se- 
lected using PartitionFinder (Lanfear et al., 2012). 
Consequently, the TVM + G, TVM + G and TVM + 1+ G sub- 
stitution models were used for the psbA-trnH spacer, the trnL 
region and the trnG region, respectively, with a Birth Death 
process tree prior. Markov chain Monte Carlo (MCMC) runs 
were extended for 20 million generations (burn-in 10 %), with 
parameters and trees sampled every 1000 generations. 
Convergence was assessed using Tracer v1.5 (Rambaut ef ail., 
2013) and then the maximum clade credibility tree was selected 
using TreeAnnotator from the BEAST package. 

We applied two calibration points. The first calibration point 
was the stem node of the genus Rosa. The oldest known fossils 
in the genus are R. germerensis (Edelman, 1975) from the 
Germer Tuffaceous member, Challis volcanic formation 
[Idaho, 55-8—48-6 million years ago (Ma)] and R. hilliae Lesq. 
(Anonymous, 1978) from the Jijuntun formation (Fushun, 
51-45 Ma; Meng ef al., 2012). To accommodate those ages, we 
used a normal prior distribution centred on 50-5Ma and a 
97-5 % confidence interval between 55 and 45 Ma. The second 
calibration point was the stem node of the Synstylae and allies 
(see the Results). A recent study by Kellner et al. (2012a) re- 
ported that some Rosa lignitum Heer fossils (30-44 + 1-52 Ma 
of age) exhibit a particular semi-craspedodromous leaf venation 
pattern that seem to be restricted to extant species of our 
Synstylae and allies clade. This character could be an adaptation 
to warmer climate but, first, our own observations suggest that 
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this adaptation to warmer climate is also restricted to the 
Synstylae and allies clade and, secondly, Kellner et al. (2012a) 
showed that R. stellata Wooton (R. subgen. Hesperhodos) lacks 
this particular leaf venation while being adapted to warm cli- 
mate. Other fossils from Colorado (Florissant) around the same 
epoch also lack this particular leaf venation (see Becker, 1963). 
The separation of this clade from the rest of the genus can be 
considered older than the first semi-craspedodromous fossil. 
The stem node of the Synstylae and allies clade was therefore 
constrained using a normal prior distribution centred on 
30:5 Ma with a 97-5 % confidence interval between 28-99 and 
32 Ma. 


Ancestral area reconstructions 


We performed ancestral area reconstruction analyses on the 
plastid chronogram obtained with BEAST. We used the dis- 
persal—extinction—cladogenesis (DEC; Ree and Smith, 2008) 
model as implemented in RASP 2.1b (Yu ef al., 2013). It uses 
the information contained in genetic branch lengths and allows 
the incorporation of changing dispersal probabilities across area 
and time. Areas were defined as described below. North 
America included Canada, the USA and northern Mexico. The 
southern boundary for Rosa distribution in North America is 
Baja California, so Central and South America are not consid- 
ered. Africa included sub-Saharan Africa only. A few species 
occur in northern Africa (Atlas Mountains) and are also distrib- 
uted in Europe. Similarly, the Northern part of the Arabian 
Peninsula is also included in the European distribution because 
only species distributed otherwise in the rest of Europe were 
found. The limit between Europe and Asia was drawn on the 
Caspian Sea, and Kazakhstan and Iran are considered as parts 
of Asia. Russia is divided into two parts: one Asian (Siberia 
and Far East) and one European (for the rest of the country). 
The maximum number of areas at each node was set to four be- 
cause the most widespread species in the genus occur in four of 
our areas. We tested different time-slice models (one to four, 
see Supplementary Data S2), which yielded identical results. 
Fossil distribution information was alternatively excluded or in- 
cluded. Early Rosa fossils from China and the USA were con- 
sidered to reconstruct the ancestral area for the genus. The 
presence of R. lignitum with a semi-craspedodromous venation 
in Europe during the Oligocene provides evidence for the pres- 
ence of the Synstylae and allies in this area, but the fossils from 
the other areas have not been examined for semi-craspedodro- 
mous venation. It is thus difficult to know whether the 
Synstylae and allies were genuinely absent outside of Europe. 
Because the ancestral area reconstruction (AAR) excluding fos- 
sil information suggested Asia to be the ancestral area for the 
Synstylae and allies clade, we considered this clade to be pre- 
sent in both Europe and Asia. 


RESULTS 
Plastid DNA phylogeny 


The plastid DNA analyses (Fig. 2) suggest that R. subgen. 
Rosa is not monophyletic but instead resolved two main clades: 
the Cinnamomeae clade and the Synstylae clade. 
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The Cinnamomeae clade is resolved with low support (55 %), 
and includes all members of R. sects Cinnamomeae and 
Carolinae (R. subgen. Rosa), as well as a few species from 
other sections (three from R. sect. Pimpinellifoliae species and 
one from R. sect. Synstylae) and one species from R. subgen. 
Platyrhodon. The Cinnamomeae clade includes a strongly sup- 
ported clade (97 %) containing almost all American species of 
R. sects Cinnamomeae and Carolinae (except R. nutkana 
C. Presl var. hispida Fernald), the two European species of 
R. sect. Cinnamomeae and some Asian species. The 
Cinnamomeae clade together with the Pimpinnelifoliae clade 
and R. subgen. Hesperhodos and Hulthemia form an unsup- 
ported clade, and will be designated by the name Cinnamomeae 
and allies. The Synstylae clade is resolved with moderate sup- 
port (76 %) and comprises nearly all members of R. sect. 
Synstylae and all members of R. sects Indicae, Caninae and 
Rosa. Within the Synstylae clade, a clade containing all the 
Asian species of R. sects Synstylae and Indicae is resolved with 
strong support (88 %) and another clade containing the other 
species of R. sect. Synstylae (from America and Europe) as 
well as R. gallica L. (monotypic R. sect. Rosa) and all members 
of R. sect. Caninae is resolved with strong support (89 %). 
Rosa sect. Caninae itself is not resolved as monophyletic and is 
divided into two clades. The first clade, the Caninae clade, 
comprises the members of R. sub-sect. Caninae with low sup- 
port (53 %) but is strongly supported (93 %) as closely related 
to other European roses from the Synstylae clade. The second 
clade, the Rubigineae clade, gathers the members of the four 
other sub-sections with strong support (93 %). A poorly sup- 
ported clade named Synstylae and allies includes the Synstylae 
clade and members of R. sects Laevigatae, Bracteatae and 
Banksianae as well as R. subgen. Platyrhodon, with no resolu- 
tion within this larger clade. 


Nuclear analyses 


The NeighborNet analyses of the GAPDH nuclear sequences 
(Fig. 3) suggest that sequences from R. roxburghii Tratt. 
(R. subgen. Platyrhodon) and those from R. sect. Bracteatae, 
R. sect. Banksianae and R. sect. Laevigatae, as well as those 
from R. sect. Pimpinellifoliae are each well differentiated from 
each other and also from the rest of the genus. Moreover, se- 
quences from Cinnamomeae and Synstylae clade members 
seem to have diversified independently except that the R. gym- 
nocarpa Nutt. (R. sect. Cinnamomeae) GAPDH sequence is 
oddly placed close to those of the Synstylae clade. Other se- 
quences available in GenBank for this species are not included 
in the analysis but gave the same result (not shown for the sake 
of clarity). Inside the Cinnamomeae group, two groups can 
be distinguished and we designate them as Cl and C2 types 
(see Fig. 3). The relationships between the Cinnamomeae 
clade sequences and the Synstylae clade sequences are very 
complex. The Cl and C2 groups are not supported by the 
chloroplast data, and some polyploids (Rosa laxa Retz. and 
R. pendulina L.) have copies from both groups. Members of 
R. sect. Caninae seem to possess a complex genome including 
alleles related to the Synstylae group and alleles related to 
the Cinnamomeae group. The alleles from R. sect. Caninae 
related to the Cinnamomeae group are divided into the two Cl 
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R. helenae 3 2x 
— R. brunonii 2x 
r— R. multiflora 1, 2, 3, 4x 
R. anemoniflora 
62] R. luciae var. luciae 2x 
A. transmorrisonensis 
R. henryii 2x 
R. luciae var. fujisanensis_ 2x 
R. soulieanat 2x 
gg), Ff. soulieana2 2x 
R. soulieana4d 2x 
— RR. SOUNCANA 32K evnsmmapemngrnnn 
-— R. chinensis var. spontanea 2,3, 4x. 
L} R. longicuspis 2x 
B ee R. rubus 1 
g7 A. rubus 2 
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R.cymosa2 2x 
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t— R.cymosa1 2x 
R.cymosa3 2x 
R. cymosa 4, 2X. 
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R. xanthina 2x 


95'"———— R. hugonis 2x 
R. taronensis 
57|R. omeiensis 2x 


R. sikangensis 
R. sericead 2x 
95) R. sericea3 2x 


ff 
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R. sericea! 2x 

g4| 7. sericea 2x 
'— R. sericea2 2x 

— R. mairei. 


Rv abyssinica 


R. tsinglingensis 
100'R. farreri, 2x i 
R. praelucens 10x 7 


2, 3x 
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100! A. multiflora 2. 2,3, 4x 
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SS RT OT 
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R. moyesii 2 476x"™~ 
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59 


R. prattii2 2x 

R. sweginzowii2 6x 
R. moyesii1 4, 6x 
R. prattii1 2x 

R. setipoda 4, 6x 
t— A. forrestiana 2x 
— R. macrophylla 1 


R. davidii 2 


R. giraldii 2 
R. saturata 


50] 


C &allies 


2, 4x 
R. nutkana var. hispida... 6X... 
[— R. spinosissima 1” 4x 
R. spinosissima2.. 4x. 
R. fedischenkoana 
R. multibracteata 


“Ay 6x" 
2, 4x 
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R. sweginzowii 1 6x 


4x 

R. woodsii var. woodsii 2x 
69] 7 yainacensis 4x 

r— R. pisocarpa 

R. californica 

75 50] R. brownii 

R. nutkana var. nutkana 6x 

R. woodsii var. ultramontana......2X. 

palustris 2x 


2, 3x 
4, 6x 


La 
64 R. 


6a F- blanda.....2x. 
R. nitida 2x 
— R. arkansana.,..4Xx. 


83 R. foliolosa. 2X 
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86 R. rugosa 2x 

R. bella 4x 

75| 7]8. willmottiae 2 2, 5x 
R.davurical 2, 4,6x 

R.davurica2 2, 4, 6x 

R. majalis 2, 3, 4, 6x 


63] 96, 


R. granulata 

R. bridgesii 

— R. lesterae 

R.gymnocarpa 2x 


99 
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———————— R. pinetorum 
96;— R.laxa1 4x 

t— R. albertii1 

—R.laxa2 4x 


R. acicularis, 


R.pendulina 2,4, 5, 7, 8x 
t— R. macrophylla 22, 4x. 
89}_ koreana vox ae 

R. beggeriana "2; 6x 

r— R. banksiopsis 

64] R. daviddi1 4, 6x 

R. murielae 

jiraldii 1 


R. corymbulosa 
R. willmottiae 1 2, 5x 
'‘— R. willmottiae3 2, 5x. 
2x3, 4,6, 7, 8X 


R. foetida. 4x” 


R. berberifolia . 
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Fic. 2. Phylogenetic relationships among Rosa species as reconstructed by Maximum Likelihood analyses of three chloroplast regions (psbA-trnH spacer, trnL region 

and trnG region). Bootstrap values are placed as close as possible to the node supported. The ploidy level of each species is given after its name (see Erlanson, 1929, 

1934, 1938; Roberts, 1977; Yokoya er al., 2000; Roberts et al., 2009; Jian et al., 2010). The names of known polyploids are in bold (in R. sect. Caninae all species 

are presumed to be polyploids even when the ploidy number is not exactly known). Wissemann’s (2003) classification is compared with our clades. A P designates 
our Pimpinellifoliae clade, a C our Cinnamomeae clade, an S our Synstylae clade, Ca our Caninae clade and Ru our Rubiginae clade. 
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0-01 


Cinnamomeae 


Pimpinellifoliae 


Eitype 


Synstylae 


‘S type 


Fic. 3. Network representing the relationships among copies of GAPDH obtained from Rosa species. A ‘c’ followed by a number indicates the number attributed to 

one particular clone sequenced. The groups are compared with our main clades from the chloroplast analyses. Purple is attributed to Rosa subgen. Platyrhodon, yel- 

low to R. sect. Banksianae, bright orange to R. sect. Bracteatae, brown to R. sect. Laevigatae, light pink to Pimpinellifoliae clade, green to Cinnamomeae clade, light 

blue to Synstylae clade and a deeper blue to R. sect. Caninae. Two species have a particular colour, R. spinosissima is highlighted with a deeper pink colour and 

R. abyssinica with a light orange colour. Two types of copies, C1 and C2, are distinguished in our Cinnamomeae group. Some polyploids have several copies with 

different affinities. The names of known polyploids are in bold (in R. sect. Caninae all species are presumed to be polyploids even when the ploidy number is not 
exactly known). 
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TaBLe 1. Composition of the Caninae genome for some of the 
species studied 


Sub-section Species Ploidy level GAPDH copies Chloroplast 


Caninae R.montana ? SSC1C1C2 Caninae, Synstylae 
Caninae R. canina 4,5 or6 SSC1C1? Caninae, Synstylae 
Caninae R. caesia 5 or 6 SSCIC1? Caninae, Synstylae 
Rubiginae R.elliptica ? SSC1C2? Rubiginae, Synstylae 
Tomentosae R. villosa 4or5 22?C1C1C2 Rubiginae, Synstylae 


Letters in the ‘GAPDH copies’ column correspond to the different types of 
GAPDH copies as defined in Fig. 3 (S for Synstylae type; C1 and C2 for two 
different groups of Cinnamomeae type). Question marks in this column indi- 
cate that some copies may not have been recovered. 

In the ‘Chloroplast’ columns, the phylogenetic origin is described according 
to clades defined in the chloroplast phylogeny (Fig. 2). Ploidy levels are de- 
tailed in Roberts et al. (2009). 


and C2 groups, indicating that the genome from members 
of R. sect. Caninae contains three kinds of alleles (see 
Table 1). Despite the high number of clones sequenced, it was 
not always possible to recover all the alleles expected. 
Sometimes one of the alleles was more frequently sequenced 
than the other (nine of 13 clones sequenced were the same in 
R. villosa L.). 


Biogeographic analyses 


The plastid analyses suggest that the Synstylae and allies and 
the Cinnamomeae and allies are characterized by contrasting 
geographic patterns (Fig. 4). Members of Cinnamomeae and 
allies are mostly Asian and American (only one African spe- 
cies, two European species and two widely distributed) and 
neither of these two geographic origins is monophyletic, imply- 
ing multiple dispersal events. Members of Synstylae and 
allies occur mostly in Asia and Europe (only one American 
species) and the European origin is monophyletic while Asian 
species are paraphyletic. The Pimpinellifolicae clade and 
most of the species from early diverging lineages (R. sects 
Pimpinellifolieae, Hulthemia,  Platyrhodon, — Bracteatae, 
Laevigatae and Banksianae) are Asian, except R. subgen. 
Hesperhodos. Our ancestral area reconstruction including fossil 
information suggests that early distribution of the genus in- 
cluded Asia and America. Combined with the divergence time 
analyses, our results also suggest that the Synstylae lineage and 
its allies extended their distribution from Asia to Europe around 
30-1 Ma and then that part of the lineage reached eastern North 
America around 17-4 Ma. The exchanges between Asia and the 
rest of the range were interrupted at 13-1 Ma and the exchanges 
between Europe and eastern North America persisted until 
8-4 Ma. Despite the early presence of the genus Rosa on the 
American continent, the ancestor of the Pimpinellifoliae and 
Cinnamomeae clade was strictly Asian. This means that among 
the extant species of American roses, only the species of R. sub- 
gen. Hesperhodos results from this ancestral widespread distri- 
bution while the other American species result from a later (at 
13-4 Ma) re-colonization from Asia. Exchanges between west- 
ern North America and eastern North America seem to persist 
even today. Exchanges between eastern North America and 
Asia were interrupted at 5-3 Ma but exchanges between western 


North America and Asia lasted longer and were finally inter- 
rupted at 4-1 Ma. Disjunctions between eastern Asia and eastern 
North America are not represented in our results. Instead, east- 
ern North American species seem to be closely related to west- 
ern North American or European species. The African R. 
abyssinica seems to have captured its chloroplast genome from 
Cinnamomeae between 9-3 and 69Ma. European 
Cinnamomeae species seem to be the results of colonization 
from Asia between 2-5 and 0-6 Ma. 


DISCUSSION 


Our study has generated the most comprehensively sampled 
and well-resolved phylogeny of the genus Rosa to date, al- 
though support for the deeper nodes of the phylogeny remains 
low. Some clades roughly corresponding to sections described 
by Wissemann (2003) are supported. Although some relation- 
ships obtained here are consistent with previous studies, others 
are new or inconsistent with previous results (see Table 2 for 
example) and these inconsistencies are likely to be the result of 
hybridization. 


Subgenus status 


The two arid-adapted subgenera (R. subgen. Hulthemia and 
Hesperhodos) are suggested as closely related despite low clade 
support, a relationship previously suggested by Wissemann and 
Ritz (2005). 

Rosa subgen. Platyrhodon (R. roxburghii and R. praelucens 
Bijh.) is not resolved as monophyletic in our plastid analyses. 
While R. roxburghii is resolved as an independent lineage in 
both our plastid and nuclear analyses, R. praelucens is sup- 
ported as a member of the Cinnamomeae clade in our plastid 
analyses. We consider the diploid R. roxburghii as a typical 
member of R. subgen. Platyrhodon. However, because allo- 
polyploids have often been reported in the genus (Joly and 
Bruneau, 2006; Joly et al., 2006; Mercure and Bruneau, 2008; 
Schanzer and Kutlunina, 2010), we suspect that the decaploid 
R. praelucens (Jian et al., 2010) is an allopolyploid resulting 
from a hybridization event (or multiple hybridization events) 
involving the diploid R. roxburghii and at least one member of 
the Cinnamomeae clade. Based on karyomorphology, Jian et al. 
(2010) also suggested that R. praelucens is of allopolyploid ori- 
gin, and a study in progress using GAPDH sequences supports 
the same conclusion (X. F. Gao, Chengdu Institute of Biology, 
CAS, Chengdu, China, pers. commun.). 

Wissemann’s (2003) classification, based on morphology, di- 
vides the genus Rosa into four subgenera (R. subgen. 
Hesperhodos, Platyrhodon, Hulthemia and Rosa), but to date 
most phylogenetic studies (Matsumoto et al., 1998; Jan et al., 
1999; Wu et al., 2000, 2001; Wissemann and Ritz, 2005; 
Bruneau ef al., 2007; Koopman ef al., 2008) have failed to 
recover a monophyletic R. subgen. Rosa. For this reason, 
Wissemann and Ritz (2005) suggested that the four subgenera 
would be best treated at the sectional level. Similarly, our 
results suggest that R. subgen. Rosa is not monophyletic with 
R. subgen. Platyrhodon as a member of the Synstylae and 
allies clade. Despite low support, the occurrence of a semi- 
craspedodromous venation in R. subgen. Platyrhodon and other 
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Rubus biflorus 
A. setigera 
R. gallica 
A. arvensis 
R. corymbifera 
R. vosagiaca 
R. x chavinii 
R. moutana 
R. subcanina 
R. caesia 
R. nitidula 
R. subcollina 
R. obtusifolia 
R. canina 

. sempervirens 
R. villosa 
R. sherardii 
R. jundzillii 
R. pouzinii 
R. elliptica 
R. micrantha 
R. rubiginosa 
R. agrestis 
R. cf. sicula 
R. glauca 1 
R. glauca 2 
R. helenae 3 
R. helenae 2 
R. helenae 4 
R. filipes 
R. helenae 1 
R. brunonii 
R. odorata v. g. 

iciae v. f. 

R. multiflora 1 
A. luciae var. luciae 
R. transmorrisonensis 
R. henryii 
R. anemoniflora 
R. rubus 1 
R. rubus 2 
R. chinensis v. s. 
R. longicuspis 
A. soulieana 1 
AR. soulieana 3 
AR. soulieana 4 
A. soulieana 2 
R. multiflora 2 
R. multiflora 3 
A. banksiae 
R. cymosa 3 

. cymosa 2 
R. cymosa 1 
R. cymosa 4 
R. laevigata 
A. bracteata 
R. roxburghii 
A. berberifolia 
A. minutifolia 1 
A. minutifolia 2 
R. foetida 
R. hugonis 
A. xanthina 
A. taronensi 
R. sikangensis 
R. omeiensis 
R. sericea 2 
R. sericea 1 
R. sericea 5 
R. sericea 3 
R. sericea 4 
R. mairei 
A. forrestiana 


R. prattii 2 
R. setipoda 

R. marcrophylla 

R. praelucens 

R. abyssinica 

A. farreri 

R. tsinglingensis 

R. aciculairs 

R. fedtschenkoana 
R. multibracteata 


R. giraldii 2 

R. nutkana var. h 
R. spinosissima 1 
R. spinosissima 2 
R. saturata 

R. willmottiae 3 
R. pendulina 

R. beggeriana 

A. koreana 

R. macrophylla 2 
R. corymbulosa 

R. willmottiae 1 
R. banksiopsis 

A. murielae 

R. giraldii 1 

R. davidii 1 


R. gymnocarpa 
R. pinetorum 
R. nutkana v. n. 
R. brownii 

R. californica 
R. yainacensis 
R. pisocarpa 
A. nitida 

A. blanda 

R. arkansana 
R. palustris 

R. carolina 

R. virginiana 
R. woodsii v. u 
R. woodsii v. w. 
A. foliolosa 

A. granulata 

R. lesterae 

A. bridgesii 

R. majalis 

R. rugosa 

R. davurica 2 
R. bella 

R. willmottiae 2 
R. davurica 1 
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Fic. 4. Chronogram obtained from BEAST analyses of chloroplast regions. Branches are coloured according to the ancestral area reconstruction analyses (DEC 
model) including fossils information. A P designates our Pimpinellifoliae clade, a C our Cinnamomeae clade and an S our Synstylae clade. The names of known 
polyploids are in bold (in R. sect. Caninae all species are presumed to be polyploids even when the ploidy number is not exactly known). 
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TABLE 2. Comparison of the recurrent Cinnamomeae and Synstylae clades from different studies 


Study 


Cinnamomeae clade 


Synstylae clade 


Zhang et al. (2013) 

SSR and flanking regions 
Bootstrap: no support 

Qiu et al. (2012) 

ITS and matkK 

Bootstrap: 79 and 83 resp. 
Meng et al. (2011) 
GAPDH 

Bootstrap: no and 56 
Koopman et al. (2008) 
AFLP; BA 

PP: 59 and no, respectively 
Koopman et al. (2008) 
AFLP; MP 

Bootstrap: no support 
Bruneau et al. (2007) 

trnL region, psbA-trnH 
Bootstrap: no support 

PP: 92 and 96, respectively 


Scariot et al. (2006) 


Cinnamomeae and Carolinae 

Some Caninae haplotypes 

R. pimpinellifolia 

Cinnamomeae* 

R. praelucens (subgen. Platyrhodon) 


Absent 
(paraphyletic) 


Cinnamomeae and Carolinae 


Cinnamomeae and Carolinae 
Most Caninae 

R. spinosissima 

Cinnamomeae and Carolinae 
R. koreana and R. spinosissima 
var. altaica (Pimpinellifoliae) 


Cinnamomeae and Carolinae 


Synstylae and Indicae 

Some Caninae haplotypes 

R. pimpinellifolia var. spinosissima 
Synstylae and Indicae 

R. alba (sect. Rosa) 


Synstylae and Indicae 
R. gallica (sect. Rosa) 


Synstylae* 
Caninae and sect. Rosa 


Synstylae* 
Some Caninae and sect. Rosa 


Synstylae and Indicae 
Caninae and sect. Rosa 
R. xanthina, R. bracteata, R. laevigata and R. cymosa 


Synstylae and Indicae 


SSR Pimpinellifoliae 
Bootstrap: no and 92 
Wissemann and Ritz (2005) 
atpB-rbcL 

PP: 83 and 70, respectively 
Wissemann and Ritz (2005) 


R. altaica 


ITS R. hugonis 
PP: 50 and no, respectively 

Wu et al. (2001) Absent 

ITS (polyphyletic) 


Bootstrap: no and 68 


Jan et al. (1999) 
RAPD 
Matsumoto et al. (1998) Cinnamomae and Carolinae 

matK R. spinosissima var. pimpinellifolia 
Bootstrap: 80 and 61, respectively 


Cinnamomae sister to Carolinae 


Cinnamomeae and Carolinae (both in part) 


Caninae and sect. Rosa 


Synstylae and Indicae 
Caninae and sect. Rosa 


Cinnamomeae (in part) and Carolinae and Bracteatae Synstylae (in part) and Indicae 


Caninae 


Synstylae only 


Synstylae sister to Indicae 


Synstylae and Indicae 
Caninae and sect. Rosa 
R. californica ‘Plena’ (Cinnamomae) 


For each study the type of data used and the supports are given when available (bootstrap or posterior probability, PP). 
AFLP, amplified fragment length polymorphism; BA, Bayesian analyses, GAPDH, glyceraldehyde 3-phosphate dehydrogenase; ITS, internal 
transcribed spacer; MP, Maximum Parsimony; RAPD, random amplified polymorphic DNA; SSR, simple sequence repeat. 


*Rosa sect. Carolinae not sampled. 
‘Rosa sect. Indicae not sampled. 


members of the Synstylae and allies clade further support 
this result. So the status of the four subgenera remains 
questionable. 


Wissemann’ s (2003) sections 


Our plastid sequence analyses resolved R. sect. Banksianae as 
monophyletic (both species sampled from wild origin) and 
closely related to but not embedded in the Synstylae clade. 
The nuclear GAPDH analyses confirm the close relationship of 
the two species and suggest that they are distinct from the 
Synstylae clade, as noted by other studies [Wu ef al., 2001, inter- 
nal transcribed spacer (ITS); Qiu ef al., 2012, ITS and matK]. 

Rosa sect. Bracteatae (one of two species sampled) and 
R. sect. Laevigatae (monospecific) are resolved as close relatives 
of the Synstylae clade, and our nuclear GAPDH sequences sug- 
gest that they form a distinct lineage. Several studies obtained 
results similar to ours for R. sect. Bracteatae (Matsumoto et al., 
1998; Wu et al., 2001), for R. sect. Laevigatae (Jan et al., 1999; 


chloroplast result only, in Wissemann and Ritz, 2005) or for the 
two sections (Qiu ef al., 2012). 

Regarding the other sections, our results are consistent with 
other studies in which the genus Rosa generally is resolved into 
two main clades (see Table 2), named Cinnamomeae and 
Synstylae clades in this study but, for example, named Clade I 
and Clade II in Bruneau ef al. (2007). Similar results are ob- 
tained with the analyses of the nuclear GAPDH sequences, sug- 
gesting that the two groups are distinct but the pattern of 
relationships within each group is complex, even when putative 
allopolyploids are removed from the analysis (Appendix). 

Both the plastid and nuclear analyses resolve R. sect. Indicae 
as embedded in R. sect. Synstylae. Similar relationships between 
these two sections have been reported in several studies (see 
Table 2: Matsumoto, 1998; Wissemann and Ritz, 2005; Scariot 
et al., 2006; Bruneau ef al., 2007; Meng ef al., 2011; Qiu et al., 
2012; Zhang et al., 2013), but other studies suggested that these 
two sections are independently monophyletic (RAPD — Millan 
et al., 1996; Jan et al., 1999; ITS —Matsumoto et al., 2000; 
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TABLE 3. Evolution of Pimpinellifoliae taxonomy and comparison with our results 
Rehder (1940) Roberts (1977) Wissemann (2003) Flora of China This study Species 
(Gu and Robertson, 2003) 
Pimpinellifoliae Pimpinellifoliae Pimpinellifoliae Pimpinellifoliae Pimpinellifoliae Clade R. omeiensis 
- - - - - R. sericea 
- - - - - R. hugonis 
- - - - - R. xanthina 
Not cited Not cited Not cited = - R. mairei 
- - - - - R. taronensis 
- - - - - R. sikangensis 
Pimpinellifoliae Pimpinellifoliae Pimpinellifoliae - Unresolved R. foetida 
- - - - Cinnamomae clade R. spinosissima 
Not cited Not cited Not cited - - R. tsinglingensis 
Pimpinellifoliae Cinnamomae - - - R. farreri 
- - Pimpinellifoliae - - R. koreana 


Wu ef al., 2001). Only one sample of each species of R. sect. 
Indicae, R. odorata (Andrews) Sweet and R. chinensis Jacq. 
were included here, but other samples of these two species 
yielded identical sequences to the samples included. They were 
not included because data were available for only one or two 
plastid regions. 

The chloroplast sequences of the two samples of the Asian 
R. multiflora Thunb. collected in China formed a clade while 
the third one collected in North America, where it is natural- 
ized, occurred in a clade with other Asian species of R. sect. 
Synstylae. The nuclear GAPDH sequences from another 
American sample were closely related to the sequences from 
our Asian sample. In North America, this species was intro- 
duced as rootstock for cultivated roses and propagated for soil 
conservation. It is possible that the individuals introduced were 
already horticultural hybrids and not pure R. multiflora. 

The only sub-Saharan species of the genus Rosa, R. abyssinica, 
is resolved by chloroplast data as embedded in the Cinnamomeae 
clade but clearly shows the synstyly consistent with R. sect. 
Synstylae. The nuclear GAPDH sequences resolve this species as 
closely related to R. gallica (R. sect. Rosa) and European species 
of R. sect. Synstylae, suggesting a hybrid origin. 

Within the Synstylae clade, a few members of R. sect. 
Synstylae from Europe and North America, the only member of 
R. sect. Rosa and all the species sampled in R. sect. Caninae 
form a sub-clade. The North American R. setigera Michx. (R. 
sect. Synstylae) is supported as sister to all the other species of 
this sub-clade, all of which are European, but GAPDH se- 
quences suggest it is more closely related to Asian species of R. 
sect. Synstylae. Species of R. sect. Synstylae of this clade (three 
species) are all diploids, while R. gallica (R. sect. Rosa) is a tet- 
raploid and most members of R. sect. Caninae for which the 
ploidy level is known are usually pentaploids. This ploidy level 
is rather unusual especially for species with a low level of apo- 
mixis compared with sexual reproduction (Wissemann and 
Hellwig, 1997; Werlemark et al., 1999; Werlemark, 2000; 
Werlemark and Nybom, 2001; Nybom ef al., 2004, 2006). 
Various authors (see Lim ef al., 2005) have described a peculiar 
heterogamous meiosis in species of R. sect. Caninae, which 
more or less maintains pentaploidy. This heterogamous meiosis 
produces tetraploid ovules and haploid pollen grains because 
two sets of chromosomes form bivalents and segregate while 
three sets remain as univalents. Similar asymmetrically 


compensating allopolyploids are described in Onosma 
L. (Kolar¢ik et al., 2014). 

Our results suggest that the genome of species in R. sect. 
Caninae is indeed complex, with two different kinds of chloro- 
plast genomes and possibly a nuclear genome with three dis- 
tinct origins (Figs 2 and 3; Table 1). Two chloroplast lineages 
within R. sect. Caninae were also reported by Wissemann and 
Ritz (2005), but they consider section Caninae as monophyletic 
based on the presence of a unique type of ITS sequences. The 
different sub-sections are not very well resolved by our phylog- 
eny but De Riek er al. (2013) managed to delineate three sub- 
sections using a different approach. Our analyses indicate that, 
in R. sect. Caninae, several copies of GAPDH originate from 
R. sect. Cinnamomeae while the other copies originate from 
R. sect. Synstylae (Fig. 3; Table 1), which is consistent with re- 
sults from microsatellite analyses (Zhang ef al., 2013) and ITS 
analyses (Ritz et al., 2005). 

The Pimpinellifoliae clade includes most species of R. sect. 
Pimpinellifoliae, but other species of this section occur in the 
Cinnamomeae clade or have an unresolved position (R. foe- 
tida). These results are similar to those of Matsumoto (2001) 
and Wissemann and Ritz (2005; chloroplast analysis). 

The species resolved in the Pimpinellifoliae clade all are 
Asian and diploids (or of unknown ploidy), with a consistent 
morphology (Table 4) that usually included them in R. sect. 
Pimpinellifoliae (see Table 3). Some diploid species (R. farreri 
Cox and R. koreana Kom.) that were previously included in 
R. sect. Pimpinellifoliae but are resolved in the Cinnamomeae 
clade are morphologically distinct from other members of the 
section by having bracts and sometimes pink or red flowers. 
These distinct species seem to be genetically close to R. sect. 
Cinnamomeae as suggested by the GAPDH analyses and 
should probably be transferred to R. sect. Cinnamomeae, as 
noted in the taxonomic revision by Roberts (1977). 

Interestingly, two other species that were included in R. sect. 
Pimpinellifoliae but that are not in the Pimpinellifoliae clade 
are polyploids or of unknown ploidy. These species have some 
morphological characters consistent with RR. _ sect. 
Pimpinellifoliae and some characters that are inconsistent with 
it (Table 4). We suspect that those species are allopolyploids. 
Our results from the nuclear GAPDH analyses (Fig. 3) confirm 
the presence of two different copies in R. spinosissima L. One 
of these copies is consistent with other members of the 
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TABLE 4. Comparative morphology of section Pimpinellifoliae (R. forrestiana from Cinnamomae is included for comparison) 


Position Species Ploidy level Bracts Petals Prickles Stipules margin Sepals Hip 
Pimpinellifoliae Clade R. omeiensis 2 Absent White Flat/terete Sinuous, enrolled Not leafy Globose 
R. sericea 2 Absent White Flat/terete Sinuous, enrolled Not leafy Globose 
R. hugonis 2 Absent Yellow Flat/terete Sinuous, enrolled Not leafy Globose 
R. xanthina 2 Absent Yellow Flat/terete Sinuous, enrolled Not leafy Globose 
R. mairei ? Absent White Flat/terete Sinuous, enrolled Not leafy Globose/ovoid 
R. taronensis ? Absent White/yellow Flat/terete Sinuous, enrolled Not leafy Globose 
R. sikangensis ? Absent White Flat/terete Sinuous, enrolled Not leafy Globose 
Unresolved R. foetida 4 Sometimes present Yellow Terete Sinuous, slightly enrolled Leafy Globose 
Cinnamomae Clade __ R. spinosissima 4 Absent Pink/white/yellow Terete Sinuous, enrolled Not leafy Globose 
R. tsinglingensis ? Sometimes present White Terete Entire, not enrolled Leafy Ovoid 
R. farreri 2 Present Pink/white Terete Entire, not enrolled Leafy Ovoid 
R. koreana 2 Present Pinkish Terete Entire, not enrolled Leafy Ovoid 
R. forrestiana 2 Present Red Terete Entire, not enrolled Leafy Ovoid 
Pimpinellifoliae clade and the other is consistent with p,, geography 


Cinnamomeae clade members particularly close to R. pendu- 
lina. Zhang et al. (2013) also suggested a close relationship be- 
tween R. spinosissima and R. pendulina but could not find any 
evidence of a relationship between R. spinosissima and other 
members of R. sect. Pimpinellifoliae. Instead, the same authors 
and Wissemann and Ritz (2005) seem to find that some individ- 
uals from these species are genetically close to R. sect. 
Caninae. This could be explained by some degree of hybridiza- 
tion between those two kinds of polyploids. 

Species of R. sect. Carolinae are embedded in the 
Cinnamomeae clade. Bruneau et al. (2007) discussed in detail 
the possible merging of R. sect. Carolinae in R. sect 
Cinnamomeae based on evidence from plastid DNA sequences 
but also morphological data (Lewis, 1957; Robertson, 1974), 
biochemical compounds (Grossi ef a/., 1998) and nuclear gene 
sequences (Joly et al., 2006). 

Conspecific individuals are not always monophyletic in the 
Cinnamomeae clade. This may be related to incomplete lineage 
sorting due to rapid speciation events or to high incidence of 
polyploids with frequent hybridization in this clade. Indeed, re- 
currently formed allopolyploids from different maternal spe- 
cies, as reported for North American allopolyploid species 
(Joly et al., 2006), could explain this pattern. Members of the 
Cinnamomeae clade show a highly variable ploidy level. They 
mostly have an even ploidy level (2-10) but a few species have 
been reported occasionally to have an uneven ploidy level (e.g. 
Rosa acicularis Lindl., R. pendulina and R. willmottiae Hemsl.; 
Roberts ef al., 2009). Alternatively, Joly (2012) has shown that 
it is not always possible to reject the null hypothesis of incom- 
plete lineage sorting when testing for hybridization between 
non-monophyletic North American species, suggesting that a 
variety of processes might explain these patterns. 

Polyploidy has been recognized as a prominent phenomenon 
in evolution and as an important cytogenetic mechanism in spe- 
ciation (Wood ef al., 2009). In the genus Rosa, hybridity is of- 
ten accompanied by polyploidy and it may have helped 
stabilize hybrids between distantly related species of the genus. 
Polyploidy may also have favoured the rate of diversification 
(R. sect. Caninae), an increase in the geographic range 
(R. acicularis, R. spinosissima) and the colonization of high-al- 
titude habitats (R. praelucens) or of high latitudes (R. acicula- 
ris). Ritz et al. (2011) suggested that the success of Caninae 
could be caused by its peculiar reproduction. 


Our results (Fig. 4) suggest that the genus Rosa probably 
evolved during the Eocene in Asia and western North America, 
with the Bering Land Bridge enabling genetic exchanges be- 
tween the two areas. This is supported by the previously cited 
oldest fossils of the genus Rosa, from Idaho and China (see the 
Materials and Methods), and by the presence of Rosa fossils in 
Alaska (see Becker, 1963; Hollick, 1936) from the Paleo- 
Eocene. At that time, the climate at those latitudes was temper- 
ate to warm temperate, which is consistent with the climate 
where most species of roses grow today. From Asia, approx. 
30 Ma, the Synstylae lineage and its allies extended their distri- 
bution westward into Europe. This period corresponds to the 
closure of the Turgai strait that would have facilitated this mi- 
gration. Following this western migration, part of the lineage 
extended its range into Eastern North America. The exchanges 
between Europe and Eastern North America persisted until the 
upper Miocene and were interrupted around 8 Ma. This is con- 
sistent with the hypothesis of Denk ef al. (2011) based on fos- 
sils from Iceland that the North Atlantic Land Bridge had been 
available until 9-8 Ma by providing a sub-aerial route with 
mild conditions. Rosaceae fossils are known in this area but dif- 
ficult to attribute to the genus Rosa. 

Our ancestral area reconstruction suggests that despite an 
early presence on the American continent, most extant 
American species are the results of a later re-colonization from 
Asia. Therefore, R. subgen. Hesperhodos could be considered 
as a relic of this early American presence of the genus Rosa. 
The ancestors of the Pimpinellifoliae and Cinnamomeae clades 
seem to have been exclusively Asian. 

The Cinnamomeae clade extended its distribution eastward, 
possibly through the Bering Land Bridge, into western North 
America then eastern North America. Exchanges between west- 
ern North America and eastern North America seem to persist 
even today. The chloroplast data suggest fairly recent (until 
early Pliocene) exchanges between Asia and western North 
America. This suggests that the Bering Land Bridge may have 
been available for a long time for Rosa despite the climatic 
cooling that occurred during the Miocene. The absence of east- 
ern North America—eastern Asia disjunction in our result con- 
trasts with results from other plants (Donoghue and Smith, 
2004). Those authors invoke greater extinction in western 
North America and Europe to explain eastern North 
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TABLE 5 Taxonomic modifications proposed based on our phylogenetic results 


Wissemann (2003) Tentative | 


Tentative 2 


Subgenus: Hulthemia 
Subgenus: Hesperhodos 
Subgenus: Platyrhodon 
Subgenus: Rosa 

Section: Bracteatae 
Section: Laevigatae 
Section: Banksianae 
Section: Pimpinellifoliae 
Section: Cinnamomeae 
Section: Carolinae 


Section: Synstylae: Treat as four sections: 


- American (R. setigera) R. setigera 

- European R. sempervirens 
(R. sempervirens, R. arvensis) R. arvensis 

- Asian 


Section: Indicae 
Section: Rosa 
Section: Caninae 


Sub-section: Rubrifoliae 
Sub-section: Vestitae 
Sub-section: Trachyphyllae 
Sub-section: Rubigineae 
Sub-section: Tomentellae 
Sub-section: Caninae 


Unchanged 


Create a nothosection 


Create a nothosection 


Could be best treated as a section 
Could be best treated as a section 


Would be best treated as a section (see Flora of China; Gu and Robertson, 2003) 


Unchanged 
Unchanged 
Unchanged 
Include members of our Pimpinellifoliae clade only 
Merge 


Merge 


Asian Synstylae (including Indicae) 


Consider as a nothosection 
(Synstylae x Cinnamomae) 
Create a sub-nothosection 


Create a sub-nothosection 


Not included (putative intersectional hybrids): 

R. spinosissima (Cinnamomae member x Pimpinellifoliae member?), 
R. abyssinica (Cinnamomae member x European Synstylae member?), 
R. praelucens (Cinnamomae member x Microphyllae member’), 

R. foetida (more data needed) 


Tentative | uses monophyletic groups from the chloroplast phylogeny. Tentative 2 considers the results from chloroplast and nuclear data for a synthetic 


approach. 


America—eastern Asia disjunctions. However, our results sug- 
gest that roses may have not suffered such extinction. 

European species of the Cinnamomeae clade seem to be the 
results of recent colonizations from Asia during the 
Pleistocene. The Caninae lineages diversification pre-dates 
those colonizations, which means it is unlikely that the 
Cinnamomeae type GAPDH sequences of species in sub-sect. 
Caninae and sub-sect. Rubigineae clades come from the 
European species of the Cinnamomeae clade. More probably, 
these Cinnamomeae type sequences come from previous ge- 
netic exchanges with Asian species of the Cinnamomeae clade 
during the latest Miocene or the Pliocene. 

Exchanges between Asia and Africa occurred during the late 
Miocene, at some time between 9-3 Ma and 6-3 Ma. This period 
is consistent with the closure of the Parathetys, which enabled 
exchanges (R6gl, 1999). It is possible that these exchanges re- 
sulted in the capture of a chloroplast genome from a member of 
Cinnamomeae clade by R. abyssinica, whose nuclear GAPDH 
sequence is close to that of European species of the Synstylae 
clade. Some of those (R. sempervirens L., R. phoenicea Boiss.) 
occur today in Lebanon (both species), Syria and Israel 
(R. phoenicia). 


Conclusions 


With the most comprehensively sampled and well resolved 
phylogeny of the genus Rosa, we provide an evolutionary 


framework that will prove useful for the study of this difficult 
genus. Our phylogenetic results of the genus Rosa allow us to 
redefine sections in the genus Rosa (Table 5); however, a 
proper revision of the genus is needed to name and typify the 
newly defined sections. For example, since Wissemann’s 
(2003) publication, The type for the genus Rosa is now in 
R. sect. Cinnamomeae, which means that this section should be 
named R. sect. Rosa. We suggest that the subgenera could be 
treated at the sectional level and that R. sects Bracteatae, 
Laevigatae and Banksianae could be left unchanged. Rosa sect. 
Pimpinellifoliae should include members of — our 
Pimpinellifoliae clade only, but this means that the type of the 
section would not be included. Rosa sects Cinnamomeae and 
Carolinae should be merged. There is a complex situation in 
the Synstylae clade due to the close relationship with the allo- 
polyploid R. sect. Caninae. This allopolyploid section could be 
treated as one or two nothosections. Rosa sect. Synstylae should 
include R. sect. Indicae and include or exclude non-Asian 
members. 

Our results also provide useful information for the studies of 
polyploids that are frequent in the genus. We formulate hypoth- 
eses about the origin of the genomes of several allopolyploids 
(R. sect. Caninae, R. praelucens and R. spinosissima) and dip- 
loid hybrids (R. abyssinica). 

Our ancestral area reconstruction suggests that despite an 
early presence on the American continent, most extant 
American species are the results of a later re-colonization from 
Asia probably through the Bering Land Bridge. Our results 
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suggest more recent exchanges between Asia and western 
North America than with eastern North America. The current 
distribution of roses from the Synstylae lineage in Europe is 
probably the result of a migration from Asia approx. 30 Ma 
ago, after the closure of the Turgai strait. 


SUPPLEMENTARY DATA 


Supplementary data are available online at www.aob.oxfordjour- 
nals.org and consist of the following. S1: list of samples used in 
this study with voucher information and GenBank accession 
numbers for the ¢rnL region, the psbA-trnH intergenic spacer, the 
tnG region and GAPDH sequences. S2: dispersal matrices used 
for the DEC ancestral area reconstructions used in the four time- 
slice model (A, C, D, E) or the two time-slice model (A, B). 
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APPENDIX 


The network representing the relationships among copies of 
GAPDH obtained from Rosa species excluding obvious 


allopolyploids. The names of other polyploids are in bold. A 
‘c’ followed by a number indicates the number attributed to 
one particular clone sequenced. The groups are compared 
with our main clades from the chloroplast analyses. Purple is 
attributed to Rosa subgen. Platyrhodon, yellow to R. sect. 
Banksianae, bright orange to R. sect. Bracteatae, brown to R. 
sect. Laevigatae, light pink to the Pimpinellifoliae clade, 
green to the Cinnamomeae clade and light blue to the 
Synstylae clade. Rosa abyssinica is highlighted with a light or- 
ange colour. Two types of copies, Cl and C2, are distin- 
guished in our Cinnamomeae group. 
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